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Abstract 

Breast cancer, one of the most common malignancies diagnosed among women worldwide, is a complex polygenic disease 
in the etiology of which genetic factors play an important role. Thus far, a subset of breast cancer genetic susceptibility loci 
has been addressed among Asian woman through genome-wide association studies (GWASs). In this study, we identified 
numerous long, intergenic, noncoding RNAs (lincRNAs) enriched in these breast cancer risk-related loci and identified 16 
single nucleotide polymorphisms (SNPs) located within the sequences of lincRNA exonic regions. We examined whether 
these 16 SNPs are associated with breast cancer risk in 2539 cancer patients and 2818 control subjects from eastern, 
southern, and northern Chinese populations. We found that the C allele of the rsl 23254890T polymorphism in the exonic 
regions of lincRNA-ENST000005 15084 was associated with a significantly increased risk of breast cancer (adjusted odds ratio 
[OR] = 1.79; 95% confidence interval [CI] = 1.50-2.12), compared with the rs12325489TT genotype. Biochemical analysis 
demonstrated that the C to T base change at rs12325489C>T disrupts the binding site for miRNA-370, thereby influencing 
the transcriptional activity of lincRNA-ENST00000515084 in vitro and in vivo, and affecting cell proliferation and tumor 
growth. Our findings indicate that the rs12325489C>T polymorphism in the lincRNA-ENST000005 15084 exon may be a 
genetic modifier in the development of breast cancer. 
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Introduction 

As one of the most common malignant tumors in women 
throughout the world, the incidence and mortality of breast cancer 
has continued to rise in developing countries over the past decade 
[1,2]. Epidemiological surveys have demonstrated that unfavor- 
able environmental exposure, lifestyle habits, and genetic factors 
are closely related with the incidence of breast cancer [3-7]. 
However, only a small proportion of exposed individuals develop 
breast cancer, suggesting that genetic factors, such as genetic 
variants (GVs) plays a more important role in an individual's risk 
of diseases [8,9]. Much attention has been focused on the 
expression of protein-coding genes; however, accumulating 
evidence points to the essential role of non-protein-coding RNA 
(ncRNA) in cellular processes and aberrant ncRNA expression in 
disease phenotypes [10-12]. 

Transcriptome analysis has shown that the majority of human 
genome transcripts consist of ncRNA, including short non-coding 
RNAs (microRNAs), small interfering RNAs (siRNAs), piwi- 



interacting RNAs (piRNAs) and long ncRNAs (IncRNAs). 
However, to date, less is known about the role of the majority of 
noncoding transcripts, represented by IncRNAs, in development 
and in cancer biology. Long intergenic non-coding RNA 
(lincRNAs) are 100-200 nucleotides or longer transcript units 
that are interspersed between known protein-coding loci, but do 
not encode proteins [13,14]. Recently, several studies have shown 
that groups of lincRNAs, correlated with various biological 
processes, including embryonic stem cell pluripotency [15,16], 
gene transcription [17,18] and chromatin-remodeling [19]. 
Furthermore, emerging evidence of dysregulated lincRNA expres- 
sion in numerous cancers is increasing, suggesting a major role for 
involvement of lincRNA in human tumorigenesis and metastasis 
[20,21]. A previously uncharacterized RNA gene, named 
PTCSC3, which acts as a tumor-suppressor, is involved in the 
predisposition to papillary thyroid carcinoma [10]. In addition, 
another lincRNA, MALAT1, was associated with a variety of 
human cancers of the breast, prostate and colon [22-25]. Yet, the 
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molecular mechanism underlying the specific role of lincRNAs in 
cancer development has remained unclear. 

Over the past few years, advances in genome-wide association 
studies (GWASs) have enabled the capture of thousands of genetic 
variants related to diseases and/or traits by high throughput 
technologies; however, at least one-third of the identified variants 
are within non-coding intervals [26]. Recently, several GWAS 
studies conducted among Asian women have reported several 
susceptibility risk loci that are associated with breast cancer risk 
[27-30]. Among these were numerous lincRNAs. Several relevant 
polymorphic sites located in the exonic regions of lincRNAs that 
may associate with breast cancer were identified; however, the 
association between genetic variations in lincRNAs exons and 
cancer susceptibility has rarely been reported. 

In this paper, we hypothesized that genetic variations in the 
exonic region of lincRNAs may affect the functions of lincRNAs 
and thereby may contribute to breast cancer. To test this 
hypothesis, we conducted a hospital-based case-control study to 
investigate the association between genetic polymorphisms in the 
exonic regions of lincRNAs and susceptibility to breast cancer in a 
Chinese population. 

Materials and Methods 

Study Subjects 

All subjects in the current study were ethnically homogenous 
Han Chinese women, and included 2539 breast cancer patients 
and 2818 healthy controls derived from eastern, southern, and 
northern Chinese populations. In the eastern Chinese population, 
patients (n = 1 006) with diagnosed breast cancer were recruited 
consecutively from 2001 to 2009 from the First Affiliate Hospital 
of Soochow University (Suzhou, China), and were from Suzhou 
city and its surrounding regions, with a response proportion of 
89%. There were no age, stage, or histology restrictions. To 
facilitate the study of breast cancer genetic susceptibility in a 
homogeneous population, 1018 cancer-free women were selected 
from a database consisting of 3,500 individuals based on a physical 
examination conducted in Jiangsu Province during the same time 
period that the cases were collected [31]. In the southern Chinese 
population, breast cancer cases (n — 772) were recruited from the 
Tumor Hospitals affiliated to Guangzhou Medical College 
between 2002 and 2009, with a response rate of 91%. Controls 
(n = 820) were selected from a pool of 5000 individuals who 
participated in a community-based screening program involving a 
health checkup conducted in Guangdong province during the 
same time period that the cases were recruited [32]. In the 
northern Chinese population, patients (n = 761) with diagnosed 
breast cancer were recruited from the Third Affiliate Hospital of 
Harbin Medical University between 2005 and 2010, and were 
from Harbin city and its surrounding regions. There were no age, 
stage, or histology restrictions. Similar to the southern Chinese 
population, controls (n = 980) were selected from a pool of 6000 
individuals who had participated in a community-based screening 
program involving a health checkup [33]. Furthermore, the 
selection criteria for all controls were that they had no history of 
cancer, and were frequency matched to each set of cases on age 
(±5 years), with a response proportion of 90%. The tumor, node, 
metastasis (TNM) classification and tumor staging were evaluated 
according to the 2002 American Joint Committee on Cancer 
Staging system. At recruitment, after giving a written informed 
consent, each subject was scheduled for an interview with a 
structured questionnaire to provide data on age, body mass index, 
family history of cancer and other factors, and none had blood 
transfusion in the last 6 months to provide a blood sample for 



DNA analysis. The study was approved by the medical ethics 
committee of Soochow University, and the institutional review 
boards of Guangzhou Medical University and the Harbin Medical 
University. Clinical features of the patients are summarized in 
Table 1. 

SNP Selection 

We searched for all published literature investigating an 
association between genetic susceptibility and breast cancer risk 
up to April 2013 using the PubMed database and Web of Science. 
Relevant search terms were "genome-wide association study", 
"GWAS", "breast cancer", "breast carcinoma", "breast neo- 
plasms", and "Asian". We also searched the reference lists in 
selected articles. We firsdy excluded some articles by scanning the 
tides and abstracts of studies. Then, after reading the full text of 
the remaining articles, we identified a final set of studies. The 
papers included in this set had to meet the following criteria: (1) 
the outcome investigated was based on GWAS in relation to breast 
cancer in humans; (2) the articles were published in English; (3) the 
latest studies were selected among overlapping data and duplicated 
data; (4) GWAS was conducted using chip technology. In the 
obtained 4 GWAS articles, only significant associations between 
genetic susceptibility loci and breast cancer risk were included. 
Finally, using this search strategy, 4 common genetic susceptibility 
loci (6q25.1, 16ql2.1, 10q21.2, llq24.3) that were independendy 
associated with breast cancer risk among Asian women in GWASs 
were identified. The selected studies are shown in Table SI. 
Using bioinformatics analysis software programs (HapMap Data 
Release 27 Phase II+III, February 2009, on NCBI B36 assembly, 
dbSNP b 1 26 and UCSC Genome Browser-hgl8 assembly), we 
identified eight lincRNAs that did not overlap with any recognized 
genes. Two lincRNAs located at the genetic susceptibility loci 
6q25.1(chr6:149.100.001-152.600.000), four lincRNAs at the loci 
16ql2.1(chrl6:45. 500.001-51. 200.000), and two lincRNAs at the 
loci llq24.3 (chrl 1:127.400.001-130.300.000) from the human 
lincRNAs database from a leading work of Ulitsky et al. [34] , in 
which the authors combined long (>200 bp) noncoding transcripts 
from Ensembl, RefSeq, UCSC genes and obtained sets of 2,458 
human lincRNAs. None of lincRNAs located at the loci 10q21.2 
(chrl0:61,200,001-64,800,000). Furthermore, Using the dbSNP 
database (http://www.ncbi.nlm.nih.gov/), bioinformatics analysis 
software program (Haploview software 4.2) and the criteria of minor 
allele frequencies of greater than 5% in the Chinese population, 
1 6 SNPs located on exons of the selected 8 lincRNAs were finally 
extracted (6q25. 1 : rs227290 1C>T, rs532367A>G, rs603092G>A; 
16ql2.1: rsl7841343G>A, rs4785367T>C, rs2278016T>C, 
rs3815784C>T, rsl362378A>C, rsl2325489C>T, rs3803662T 
>C; llq24.3: rsl0750417G>A, rsl08941 15C>G, rsl814343T> 
C, rsl814344T>C, rs4937447C>T, rsl08941 16G>A) (Table 2). 

Genotyping Analysis 

Genome DNA was extracted from peripheral blood lympho- 
cytes of the study subjects. Allele-specific MALDI-TOF mass 
spectrometry was used to genotype the markers used in the 
association analyses, as previously described [7] ; genotyping of all 
markers was performed by MassArray (Sequenom, SanDiego, 
CA). All breast cancer patients and healthy controls in Suzhou 
center were geno typed for the 16 polymorphisms; patients and 
controls from Guangzhou and Harbin centers were genotyped 
only for the polymorphism rs 12325489 to verify the results 
obtained in the Suzhou population. The genotyping results were 
further confirmed by direct sequencing. 
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Table 2. Summary of results for selected SNPs showed associations with risk of breast cancer. 



Loci 


Chr:position a 


LincRNA:position" 


MAF (case, control) 


OR (95%CI) b 


P value 


Eastern Chinese (Suzhou) 


rs12325489(C>T) 


16q:50864577 


ENST00000515084 exon 1 


T = 0.29, 0.38 


1.49 (1.31- 


1.68) 


<1.0x10 7 


rs2272901 (OT) 


6q:149581466 


uc003qmi.2 exon 2 


T = 0.41, 0.41 


1 .09 (0.96- 


1.25) 


0.182 


rs532367(A>G) 


6q: 149584491 


uc003qmi.2 exon 3 


G = 0.45, 0.46 


1 .02 (0.90- 


1.16) 


0.679 


rs603092(G>A) 


6q: 149584784 


uc003qmi.2 exon 3 


A = 0.13, 0.14 


1.12 (0.93- 


1.36) 


0.469 


rs17841343(G>A) 


16q:48512623 


ENST00000506071 exon 1 


A = 0.21, 0.21 


0.98 (0.84- 


1.15) 


0.915 


rs4785367(T>C) 


16q:48513695 


ENST00000506071 exon 1 


C = 0.28, 0.30 


1.11 (0.96- 


1.28) 


0.071 


rs2278016(T>C) 


16q:50674510 


uc002egu.2 exon 1 


C = 0.13, 0.13 


0.97 (0.80- 


1.18) 


0.882 


rs3815784(OT) 


16q:50675154 


uc002egu.2 exon 1 


T = 0.07, 0.06 


0.78 (0.60- 


1.02) 


0.066 


rs1362378(A>C) 


16q:50863415 


ENST000005 15084 exon 1 


C = 0.45, 0.45 


1.01 (0.89- 


1.15) 


0.943 


rs3803662(T>C) 


16q:5 1143842 


NR_033920 exon 1 


C = 0.35, 0.35 


0.96 (0.83- 


1.10) 


0.913 


rsl 075041 7(G> A) 


11q:1 28987385 


ENST000005 11090 exon 1 


A = 0.21, 0.20 


0.92 (0.79- 


1.09) 


0.313 


rsl 08941 15(C>G) 


11q:1 29070287 


uc001qfd.1 exon 1 


G = 0.23, 0.22 


0.91 (0.78- 


1.06) 


0.148 


rs1814343(T>C) 


11q:1 29070462 


uc001qfd.1 exon 1 


C = 0.39, 0.39 


1 .00 (0.88- 


1.15) 


0.915 


rs1814344(T>C) 


11q:1 29070626 


uc001qfd.1 exon 1 


C = 0.24, 0.23 


0.98 (0.84- 


1.14) 


0.513 


rs4937447(OT) 


11q:1 29070761 


uc001qfd.1 exon 1 


T = 0.47, 0.45 


0.88 (0.77- 


1.01) 


0.146 


rs10894116(G>A) 


11q:1 29072920 


ucOOIqfd.1 exon 1 


A = 0.16, 0.18 


1.09 (0.91- 


1.29) 


0.358 


Southern Chinese (Guangzhou) 


rs12325489(OT) 


16q:50864577 


ENST00000515084 exon 1 


T = 0.29, 0.39 


1.52 (1.31- 


1.79) 


<1.0x10 7 


Northern Chinese (Harbin) 


rs12325489(OT) 


16q:50864577 


ENST00000515084 exon 1 


T = 0.26, 0.36 


1.58 (1.34- 


1.82) 


<1.0x10 7 


Pooled analysis 


rs12325489(OT) 


16q:50864577 


ENST00000515084 exon 1 


T = 0.28, 0.37 


1.51 (1.37- 


1.68) 


<1.0x10 7 



a Data of position based on the International HapMap project (Rel 27 Phasell+lll, Feb09, on NCBI B36 assembly, dbSNP 
b Data were calculated by unconditional logistic regression, adjusted for age, BMI, and family history of cancer. 
doi:1 0.1 371/journal.pone.0098251 .t002 



b126). 



Animals and Cell Culture 

Twenty female BALB/ c nude mice that were 4-5 weeks of age 
and each weight (20±2)g were purchased from the Shanghai 
Laboratory Animal Center at the Chinese Academy of Sciences 
(Shanghai, China). The mice were allowed to acclimate to local 
conditions for at least 1 week and were maintained on a 1 2 h light- 
dark cycle with food and water ad libitum. This study was carried 
out in strict accordance with the recommendations in the Guide 
for the Care and Use of Laboratory Animals of the National 
Institutes of Health. The protocol was approved by the Committee 
on the Ethics of Animal Experiments of the Laboratory Animal 
Center of Soochow University (Permit Number: SYXK(- 
SHU20120045)). Human breast cancer cell lines (MCF-7 and 
Bcap-37) were purchased from the Cell Bank of Type Culture 
Collection of the Chinese Academy of Sciences, Shanghai Institute 
of Cell Biology, and were passaged for fewer than 6 months. MCF- 
7 and Bcap-37 cells were cultured in RPMI 1640 medium (Gibco- 
BRL, Gaithersburg, MD, USA) supplemented with 10% fetal 
bovine serum (Gibco-BRL, Gaithersburg, MD, USA) and 
1 xantibiotics/antimycotics, at 37°C in the presence of 5% C0 2 . 

In-silico Prediction of Folding Structures Induced by 
rs12325489C>T in lincRNA-ENST00000515084 

It is plausible that certain structures are more likely to play key 
roles in biological functions; thus, structural rearrangement may 
influence the expression and functions of genes by affecting its 
folding structures. We used RNAfold [35] and SNPfold algorithms 



[36,37] to predict the putative influence of rsl2325489C>T on 
the local folding structures of lincRNA-ENST000005 15084 by 
analyzing the 61 -bp regions flanking the polymorphism. 

Subcellular Fractionation 

Cells from 2 different breast cancer cell lines, namely, Bcap-37 
and MCF-7, were cultured in a humidified incubator for 2 days. 
For subcellular fractionation experiments, up to 2xl0 6 cells were 
used. Cytosolic and nuclear extracts from breast cancer cells were 
collected using a Nuclear/Cytosol Fractionation kit (Biovision, 
USA) according to the manufacturer's instructions. Briefly, Bcap- 
37 and MCF-7 cells were lysed with a buffer containing 10 mM 
Tris-HCl (pH = 7.4), 100 mM NaCl, 2.5 mM MgC12, and 
40 mg/ ml digitonin for 1 0 min. The resulting lysates centrifuged 
with 2,060 xg for 10 min at 4°C. The supernatant was used for the 
cytosolic fraction. Subsequently, the pellets were washed and 
incubated with RIPA buffer at 4°C for 10 min. After centrifuga- 
tion at 4°C for 10 min at 2,060 xg, the nuclear fraction was 
collected. 

Construction of Reporter Plasmids 

C-allelic reporter constructs were prepared by amplifying the 
lincRNA exonic region spanning the 258 bp flanking the 
rsl 2325489 polymorphism from subjects homozygous for the C 
aUele (rsl2325489CC) with the forward primer 5'- 
CCGCTCGAGCCATTGGTAAGAAGCA-3 ' and the reverse 
primer 5'-ATTTGCGGCCGCCTTTGAATAGGGAAGAAC- 
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Table 3. Rs12325489C>T genotype among patients and controls and their association with risk of breast cancer. 





Genotypes 


Cases 


Controls 


OR (95%CI) a 


P value 




n (%) 


n (%) 






Eastern Chinese (Suzhou) 


n = 1006 


n = 1018 






rs!2325489C>T 


TT 


92 (9.1) 


146 (14.3) 


1.00 (reference) 


<1.0x10 5 


CT 


398 (39.6) 


474 (46.6) 


1.32 (0.97-1.80) 




CC 


516 (51.3) 


398 (39.1) 


2.08 (1.53-2.82) 




CT+CC 


914 (90.9) 


872 (85.7) 


1.67 (1.25-2.23) 




Southern Chinese (Guangzhou) 


n = 772 


n = 820 






rsl2325489C>T 


TT 


73 (9.5) 


133 (16.2) 


1.00 (reference) 


<1.0x10 5 


CT 


308 (39.9) 


372 (45.4) 


1.46 (1.05-2.03) 




CC 


391 (50.6) 


315 (38.4) 


2.20 (1.59-3.06) 




CT+CC 


699 (90.5) 


687 (83.8) 


1.80 (1.32-2.45) 




Northern Chinese (Harbin) 


n = 761 


n = 980 






rs12325489C>T 


TT 


53 (7.0) 


133 (13.6) 


1.00 (reference) 


<1.0x10 5 


CT 


290 (38.1) 


438 (44.7) 


1.67 (1.17-2.38) 




CC 


418 (54.9) 


409 (41.7) 


2.52 (1.84-3.71) 




CT+CC 


708 (93.0) 


847 (86.4) 


2.12 (1.52-2.97) 




Pooled analysis 


n = 2S39 


n = 2818 






rs!2325489C>T 


TT 


218 (8.6) 


412 (14.6) 


1.00 (reference) 


<1.0x10 7 


CT 


996 (39.2) 


1284 (45.6) 


1.43 (1.18-1.72) 




CC 


1325 (52.2) 


1 1 22 (39.8) 


2.20 (1.83-2.66) 




CT+CC 


2321 (91.4) 


2406 (85.4) 


1.79 (1.50-2.12) 





a Data were calculated by logistic regression analysis with adjusted for age, BMI, and family history of cancer. 
doi:1 0.1 371 /joumal.pone.0098251 .t003 



3', which included Xhol and Notl (Fermentas, Hanover, MD, USA) 
restriction enzyme sites, and cloning these fragments into 
psiCHECK-2 (Promega, Madison, WI, USA). A Quick Change 
XL site-directed mutagenesis kit (Stratagene, La Jolla, CA) was 
used to obtain rsl2325489T reporter constructs from the 
psiCHECK-2-rsl2325489C constructs by site-directed mutagen- 
esis as described previously [38]. The amplified exonic regions 
including the rsl2325489C>T polymorphism were inserted into 
the Xhol and jVofl enzyme sites of the 3'-UTR of the Renilla 
luciferase gene in the vector psiCHECK-2. Finally, the resulting 
constructs (psiCHECK-2-rsl2325489T and psiCHECK-2- 
rsl2325489G) were sequenced to confirm the allele, orientation, 
and integrity of each insert. 

Transient Transfections and Luciferase Assays 

Bcap-37 and MCF-7 cells were seeded in 24-well plates (1 xlO 5 
cells per well) and cultured to 60-70% confluence before 
transfection; cells were then transfected with the reporter plasmids 
described above using Lipofectamine 2000 (Invitrogen, CA, USA). 
In each well, co-transfection was performed using 800 ng of 
constructed plasmid DNA and 0, 1, or 40 pmol miRNA-370 
mimics (Shanghai GenePharma Co., Ltd.), and with or without 
40 pmol miRNA-370 inhibitor, according to the manufacturer's 



instructions. Additionally, for every miRNA transfection experi- 
ment, 100 pmol of non-specific miRNA (GenePharma Co., Ltd.) 
was used as a negative control. After transfection for 24 h, 100 uL 
luciferase assay reagent was added to assay the cells. Luciferase 
activity was measured with the Dual-Luciferase Reporter assay 
system (Promega, Madison, WI, USA) using a TD-20/20 
luminometer (Turner Biosystems, Sunnyvale, CA, USA) according 
to the manufacturer's instructions, and the results were normalized 
against the activity of the Renilla luciferase gene. Each group 
included 6 replicates, and independent triplicate experiments were 
performed. 

Real-time PCR Analysis 

Thirty-nine breast cancer tissue specimens were obtained from 
biopsies of patients and were stored in liquid nitrogen before 
analysis. Each subject signed a written consent approved by the 
medical ethics committee of Soochow University. Total RNA was 
obtained from these cancerous tissues with TRIzol reagent 
(Molecular Research Center, Inc). According to the manufactur- 
er's protocol, cDNA was generated from mRNA using the random 
primer and Superscript II (Invitrogen). Real-time quantitative 
polymerase chain reaction (PCR) was carried out to quantify the 
relative gene expression of lincRNA-ENST000005 15084, using an 
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ABI Prism 7500 sequence detection system (Applied Biosystems) 
based on the SYBR-green method. All quantifications were 
performed using GAPDH as an internal reference gene. The 
primers used for PCR amplification of the lincRNA- 
ENST000005 15084 cDNA were 5'-CTCTTCCATCTGGCT- 
GTTTCTAA-3' (F) and S'-GCTTCAAATGTTGATG- 
GAAAGTC-3' (R). For GAPDH, the primers used were 5'- 
GAAGGTGAAGGTCGGAGTC-3' (F) and 5 ' -GAAGATGGT- 
GATGGGATTTC-3' (R). The lincRNA-ENST000005 15084 
expression measurements in individuals were normalized against 
the expression of GAPDH, using a modified method. In addition, 
we subsequendy examined the relative level of lincRNA- 
ENST000005 15084 in both the nucleus and cytoplasm of breast 
cancer cell lines. The All-in-One TM miRNA qRT-PCR 
Detection kit (GeneCopoeia, Rockville, MD, USA) was used to 
detect basal miRNA-370 expression in breast cancer tissues, 
according to the manufacturer's protocol. The expression of 
miRNA-370 was calculated relative to the U6 small nuclear RNA. 
All analyses were performed in a blinded fashion in which the 
laboratory personnel were unaware of the genotyping data. 

Actinomycin D Assay 

Bcap-37 and MCF-7 cells were plated in 24-well culture plates 
(5xl0 4 cells per well). Sixteen hours after plating, the cells were 
transfected with 0, 1, or 40 pmol miRNA-370 mimics or with 
miRNA-370 inhibitor (Shanghai GenePharma Co., Ltd.). Twenty- 
four hours after transfection, cells were incubated with actinomy- 
cin D (Sigma) for 1, 2, or 3 h as previously described [39]. 
Actinomycin D was used at a final concentration of 2 ug/mL. Six 
replicates for each group and the experiment were repeated at 
least 3 times. 

Cell Viability Assay 

In 96-well, flat-bottomed plates (BD Biosciences, Bedford, MA), 
100 uL MCF-7 and Bcap-37 cells suspension (10,000 cells per mL) 
were aliquoted into each well. After transfection as described 
above for the actinomycin D assay and 1, 2 and 3 days of 
cultivation, cell viability was measured using cell viability was 
measured by Cell Counting Kit-8 (CCK-8) system (Dojindo 
Laboratory, Kumamoto, Japan) according to the manufacturer's 
instructions. Briefly, 10 ul CCK-8 solution of each well was added, 
the plates were then incubated at 37°C for 1 h, and the 
absorbance of each well was read at 450 nm using a microplate 
reader (MRX; Dynex Technologies, West Sussex, United King- 
dom). Six replicates for each group and the experiment were 
repeated at least 3 times. 

Lentiviral Production and Transduction 

The precursor sequence of miRNA-370 was synthesized by the 
Genewiz Company (Suzhou, China) and then cloned into the 
lentiviral expression vector pLVX-IRES-neo (Clontech Laborato- 
ries Inc., San Francisco, CA, USA). The resulting construct 
(pLVX-IRES-neo-miRNA-370) was verified by direct sequencing. 
Using a 3-plasmid transient co-transfection method (Lenti-T HT 
Packaging Mix, Clontech), replication-defective VSV-G pseudo- 
typed viral particles were packaged in human embryonic kidney 
cells LentiX 293T (Clontech Laboratories Inc.). The virus was 
then harvested and filtered. For transduction, MCF-7 and Bcap-37 
cells were infected with the control lentivirus (empty vector 
containing only lentivirus without the miRNA-370 fragment) or 
miRNA-370 lentivirus. After 48 h of transduction, the cells were 
stably selected with G418 (500 ug/mL), and the drug-resistant cell 
populations were used for subsequent studies. 



Xenografts in Mice 

Cells harboring the MCF-7-empty vector, MCF-7-miRNA-370, 
Bcap-37-empty vector, or Bcap-37-miRNA-370 were diluted to a 
concentration of 5xl0 6 cells per mL in physiological saline, and 
0.1 mL of the suspension was injected subcutaneously into the 
posterior flank of mice. Five nude mice were used for each group. 
When a tumor was palpable, tumor growth was measured every 
other day following subcutaneous injection of tumor cells on one 
or both sides of the back of syngeneic mice, by measuring the 2 
largest perpendicular diameters with calipers, and tumor volume 
was calculated according to the following formula: V = L xW 2 x0.5 
(L, length; W, width). 

Statistical Analysis 

The differences in the distributions of age, menstrual history, 
BMI, and family history of cancer between cases and cancer-free 
controls, as well as the allele and genotype frequencies were 
assessed by two-sided chi-squared tests. Unconditional logistic 
regression models was used to assess the association between the 
risk of breast cancer and each SNP by odds ratios (ORs) and their 
95% confidence intervals (95%CIs), with adjustments for age, 
BMI, and family history. Logistic regression modeling was used in 
the trend test, as well as to evaluate the potential multiplicative and 
additive gene-gene and gene-environmental factor interactions. 
Furthermore, the data were further stratified by age, age at 
menarche (years), menstrual history, BMI, and family history of 
cancer to evaluate the stratum variable-related ORs among various 
lincRNA-ENST000005 15084 rsl2325489C>T genotypes, which 
were further tested for homogeneity within the strata. Statistical 
power was computed by applying the PS software (http://bio.stat. 
mc.vanderbilt.edu/twiki/bin/ view/Main/PowerSampleSize, ac- 
cessed Dec 14, 2010). Student's t-test was used to compare the 
differences in the levels of luciferase reporter gene expression. 
Kruskal-Wallis one-way ANOVA test and linear regression models 
were used to evaluate the effect of various SNPs on the lincRNA- 
ENST000005 15084 transcript expression. All tests were two-sided 
by using the SAS software (version 9.1; SAS Institute, Cary, NC, 
USA). A P<0.05 was used as the criterion for statistical significance. 

Results 

Genotypes and Risk of Breast Cancer 

In the discovery set analysis of these sixteen SNPs genotyped in 
the eastern Chinese population, a significant association with 
breast cancer risk was observed for rsl2325489C>T but not for 
the other (Table 2). Specifically, the adjusted ORs associated with 
the lincRNA-ENST000005 15084 rsl2325489CC and CT geno- 
type were 2.08 (95%CI = 1.53-2.82) and 1.32 (95%CI = 0.97- 
1.80), when compared with the rsl2325489TT genotype. These 
associations were confirmed in the southern and northern Chinese 
population, where the adjusted ORs of carrying the 
rsl2325489CC and CT genotype were 2.20 (95%CI= 1.59- 
3.06) and 1.46 (95%CI= 1.05-2.03) in the southern Chinese 
population and 2.52 (95%CI = 1.84-3.71) and 1.67 
(95%CI= 1.17-2.38) in the northern Chinese population, respec- 
tively, when compared with the rsl2325489TT genotype 
(Table 3). 

A stratified analysis assessing the associations between the 
lincRNA-ENST000005 15084 rsl2325489C>T genotypes and 
the risk of breast cancer was conducted. However, there was no 
significant association in any of the subgroups (Table 4). 
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Figure 1. Cellular and molecular characterization of lincRNA-ENST00000515084. (A) The levels of nuclear control transcript (U6), 
cytoplasmic control transcript (GAPDH mRNA), and lincRNA-ENST000005 15084 were assessed by RT-qPCR in nuclear and cytoplasmic fractions. Data 
are mean±standard error of the mean. Data are presented as a percentage of U6, GAPDH and lincRNA-ENST00000515084 levels and total levels for 
each were taken to be 100%. (B) In-silico prediction of folding structures induced by rs12325489C>T in lincRNA-ENST00000515084. The mountain 
plot is an xy-graph that represents a secondary structure including WIFE structure, the thermodynamic ensemble of RNA structures (pf), and the 
centroid structure in a plot of height versus position, "mfe" represents minimum free energy structure; "pf" indicates partition function; "centroid" 
represents the centroid structure. 
doi:10.1371/journal.pone.0098251.g001 
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Figure 2. LincRNA-ENST0000051 5084 rs12325489C>T genotypes affect lincRNA-ENST0000051 5084 expression by regulating the 
miRNA-370 binding in vitro. (A) Reporter plasmids construction. Representative graph of luciferase activity of variant allele on luciferase reporter 
genes bearing the lincRNA-ENST00000515084 exonic region spanning 258 bp flanking the rs12325489C>T polymorphism segments in Bcap-37 (B) 
and MCF-7 cells (C). Relative luciferase activity of the psiCHECK-2-rs12325489T and psiCHECK-2-rs12325489C constructs co-transfected with miRNA- 
370 and inhibitor in Bcap-37 and MCF-7 cells (D). Renilla luciferase activity was measured and normalized to firefly luciferase. Six replicates were 
carried out for each group, and the experiment was repeated at least three times. Data are mean±standard error of the mean. Asterisk indicates a 
significant change (P<0.01). (E) LincRNA-ENST00000515084 expression level in thirty-nine breast cancer tissues for different rs12325489C>T 
genotypes (19 rs12325489CC, 15 rs12325489CT and 5 rs12325489TT); data are mean±standard error of the mean. lincRNA-ENST00000515084 
expression normalized to GAPDH (P = 0.026 for lincRNA-ENST00000515084). 
doi:1 0.1 371 /journal.pone.0098251 .g002 
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Figure 3. Effects of rs12325489C>T genotypes on cells' proliferation and tumor growth. (A) The different steady-state of lincRNA- 
ENST000005 15084 in rs12325489TT (Bcap-37) cells and rs12325489CC (MCF-7) cells. Twenty-four hours after transfecting with miRNA-370 mimics or 
miRNA-370 inhibitor, rsl 2325489TT (Bcap-37) cells and rs1 2325489CC (MCF-7) cells were incubated with actinomycin D for 1 , 2 or 3 hours. The decline 
transcript levels of lincRNA-ENST00000515084 and GAPDH mRNA levels after actinomycin D treatment was measured by real-time PCR analysis; data 
are mean±standard error of the mean, normalized to GAPDH. *P<0.05 compared with control. (B) Cells' proliferation rate was significantly inhibited 
in rsl 2325489CC (MCF-7) cells by binding miRNA-370. Cell proliferation was performed by the cell viability assay. Six replicates for each group and the 
experiment repeated at least three times. Data are mean±standard error of the mean (P = 0.001). *P<0.05 compared with control. (C) Effects of 
rs12325489C>T genotypes on tumor growth by binding miRNA-370. Figure shown is a representative of five nude mice for each group at the third 
week after xenograft-transplanted nude mouse tumor models of human breast cancer growth established. *P<0.05 compared with control. 
doi:1 0.1 371 /journal.pone.0098251 .g003 
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Cellular Characterization of lincRNA-ENST000005 15084 

The levels of nuclear control transcript (U6), cytoplasmic control 
transcript (GAPDH mRNA), and lincRNA-ENST000005 1 5084 
were assessed by RT-qPCR in nuclear and cytoplasmic fractions, 
respectively. The results showed that GAPDH mRNA was 
exclusively detected in the cytoplasmic fraction, while nucleus- 
retained U6 was predominandy found in the nuclear fraction. 
Further RT-qPCR analysis revealed that lincRNA- 
ENST000005 15084 expression was predominantly cytoplasmic 
(Figure 1A). 

In-silico Analysis of the Effect of rs12325489C>T on 
lincRNA-ENST00000515084 Folding 

Using RNAfold and SNPfold algorithms in in-silico analysis, we 
predicted local structural changes of lincRNA-ENST000005 15084 
caused by the rsl2325489C>T polymorphism located within the 
exonic region of lincRNA-ENST000005 15084. As shown in 
Figure IB, the results suggested that the C to T base change of 
rsl2325489C>T affects the folding of lincRNA- 
ENST000005 15084, which may affect the binding site for the 
microRNAs. This may then influence the transcriptional activity 
of the lincRNA-ENST000005 15084 gene. 

Rs12325489C>T Genotypes Affect lincRNA- 
ENST00000515084 Expression by Inhibiting the Binding 
of miRNA-370 in vitro 

We used bioinformatics analysis programs (http://snpinfo. 
niehs.nih.gov/snpinfo/snpfunc.htm) to predict a binding site for 
human microRNA within the lincRNA-ENST000005 1 5084 
region that contains the rsl2325489C>T polymorphism 
(Figure 2A). And the result showed that only miRNA-370 binds 
to lincRNA-ENST000005 15084 transcripts containing the 
rsl2325489C allele, while there are 4 microRNAs (miRNA- 
1229, miRNA- 1260b, miRNA-617, miRNA-1260) that bind to 
lincRNA-ENST000005 15084 transcripts containing the 
rsl2325489T allele. The sequence of the predicted 5 microRNAs 
binding sites were presented in Table S2. These 5 microRNAs 
and either of the polymorphic reporters constructs were transiendy 
co-transfected into Bcap-37 cells repectively, and luciferase activity 
subsequently assessed. The cells transiendy co-transfected miRNA- 
370 mimics and construct containing the rsl2325489G allele 
exhibited significandy reduced luciferase activity, in a concentra- 
tion-dependent manner, compared with the construct containing the 
rsl2325489T allele (Figure 2B). The same result was also observed 
when these experiments were repeated using MCF-7 cells 
(Figure 2C), but the other 4 microRNAs did not reveal significandy 
luciferase activity (Figure SI). The reporter vectors (psiCHECK-2- 
rsl2325489C and psiCHECK-2-rsl2325489T), miRNA-370 mim- 
ics, and miRNA-370 inhibitor were transiently co-transfected into 
Bcap-37 and MCF-7 cells. In both cell lines, no significant differences 
in luciferase activity were observed in the presence of the miRNA-370 
inhibitor (Figure 2D). These results suggest that miRNA-370 can 
bind and negatively regulate the transcription of lincRNA- 
ENST000005 15084 in the presence of rsl2325489C allele. 

Association of lincRNA-ENST000005 15084 rs12325489C> 
T Genotypes with lincRNA-ENST00000515084 Expression 

We performed RT-qPCR to further evaluate the effects of 
rsl2325489C>T on lincRNA-ENST000005 1 5084 expression 
using 39 breast cancer tumor tissues with different genotypes. As 
shown in Figure 2E, patients with the rsl2325489TT genotype 
expressed significandy higher lincRNA-ENST000005 1 5084 
mRNA levels (mean±SEM: 0.032±0.010), compared to carriers 



of the rsl2325489CT (0.018±0.006) and rsl2325489CC geno- 
types (0.008±0.003; ANOVA test: P= 0.026). And miRNA-370 is 
constitutively expressed in the 39 breast cancer tumor tissues; 
however, there was no significant association between the 
background expression of miRNA-370 and rsl2325489C>T 
genotypes (0.026±0.003 for CC; 0.021 ±0.007 for CT and 
0.026±0.008 for TT; ANOVA test: P= 0.810; linear regression 
test: P=0.137) (Figure S2). 

Effects of the lincRNA-ENST00000515084 rs12325489C>T 
Genotypes on Cell Proliferation 

DNA sequencing showed that MCF-7 and Bcap-37 cells harbor 
rsl2325489CC and rsl2325489TT genotypes, respectively. We 
further investigated the effect of lincRNA-ENST000005 1 5084 
rsl2325489C>T genotypes on cell proliferation in vitro. Twenty- 
four hours after transfecting MCF-7 and Bcap-37 cells with 
miRNA-370 mimics or with miRNA-370 inhibitor, as shown in 
Figure 3A, the transcript levels of the lincRNA- 
ENST000005 15084 rsl2325489CC genotype (MCF-7 cells) was 
down-regulated from 100% to 85% ±3. 2% after RNA synthesis 
was blocked with actinomycin D for 1 h in the presence of 1 pmol 
miRNA-370. Furthermore, the remaining level of lincRNA- 
ENST000005 1 5084 after exposure of cells to actinomycin D for 
1 h in the presence of 40 pmol miRNA-370 was only 27%±3.6%. 
However, these treatments resulted in no significant change in 
transcript levels of lincRNA-ENST000005 15084 with the 
rsl2325489TT genotype in Bcap-37 cells (P= 0.623). In addition, 
cell viability assays revealed that cell proliferation rate was 
significandy inhibited in cells carrying the rsl2325489CC 
genotype that had been transfected with miRNA-370 
(P= 0.001). However, there were no noteworthy changes in cell 
growth rate in cells with the rsl2325489TT genotype (P= 0.739) 
(Figure 3B). 

Effects of rs12325489C>T Genotypes on Tumor Growth 
by Binding to miRNA-370 

We evaluated the effects of rsl2325489C>T genotypes on 
tumor growth in xenografts model. Mice were injected with cells 
harboring the rsl2325489CC (MCF-7)-empty vector, 
rsl2325489CC (MCF-7)-miRNA-370, rsl2325489TT (Bcap-37)- 
empty vector, or rsl2325489TT (Bcap-37)-miRNA-370. As shown 
in Figure 3C, tumor growth from rsl2325489CC (MCF-7)- 
miRNA-370 cell xenografts was delayed by 5 days compared to 
that of tumors formed from rsl2325489CC (MCF-7)-empty vector 
cell xenografts, and the mean tumor volume after 3 weeks in the 
former group was about 424±69 mm , approximately 391 mm 
smaller than those resulting from rsl2325489CC (MCF-7)-empty 
cell xenografts. Growth of tumors from rsl2325489TT (Bcap-37)- 
miRNA-370 cell xenografts was not inhibited when compared 
with that of tumors formed from rsl2325489CC cells. 

We further used 2 additional models, using rsl2325489CC 
(MCF-7) and rsl2325489TT (Bcap-37) wild-type cells, to inves- 
tigate the general validity of our findings. The result showed that 
tumor growth was comparable to that of tumors formed from 
rsl2325489CC (MCF-7)-empty vector and rsl2325489TT (Bcap- 
37)-empty vector cell xenografts. 

Discussion 

In the hospital-based case-control study containing 2539 breast 
cancer patients and 2818 controls, we found that the risk of breast 
cancer was significantly associated with the rsl2325489C allele, 
which itself changes miRNA-370-mediated lincRNA- 
ENST000005 15084 expression. Additionally, compared to the 
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lincRNA-ENST000005 15084 rsl2325489T allele, the 
rsl2325489C allele was correlated with an increase in the 
proliferation rate of breast cancer cells in vitro and in vivo.The 
association between rsl2325489C>T polymorphism and breast 
cancer indicated the possibility that this genetic variant in 
lincRNA-ENST()00()05 15084 exon may be a common suscepti- 
bility factor for developing breast cancer. 

As one type of long non-coding RNA genes, lincRNAs are often 
tissue-specific mRNA-like transcripts lacking significant open 
reading frames [40,41]. In various human tissues, lincRNAs are 
emerging as new players, associated with development in a stage- 
specific manner and diseases including cancers. It is biologically 
reasonable that functional lincRNAs may play a role in the 
development of cancer. One famous example is the lincRNA 
GAS5; genetic aberrations at this lincRNA locus have been found 
in many types of tumors, including melanoma, breast, and prostate 
cancers [42-44]. These lines of evidence suggest a crucial role of 
lincRNA in tumorigenesis. Recently, a subset of breast cancer 
susceptibility loci (6q25.1, 16ql2.1, 10q21.2, and llq24.3) has 
been identified among Asian women through GWASs. Bioinfor- 
matics analysis revealed several lincRNAs mapped to these genetic 
susceptibility loci. Our results are consistent with previous findings 
[45,46], suggesting that dysregulation of lincRNA is associated 
with susceptibility to breast cancer in Chinese populations. 
Furthermore, cell proliferation experiments indicated that a subtle 
change in lincRNA-ENST000005 1 5084 function due to the 
rsl2325489C>T polymorphism may interfere with cell prolifer- 
ation. 

Cumulative evidence has demonstrated that numerous sets of 
lincRNAs themselves have distinct and critical biological functions 
in X-chromosome inactivation (Xist, Tsix) [47,48], imprinting 
(HI 9, Air) [49,50], regulation of gene expression (HOTAIR) [51], 
and reprogramming of human induced pluripotent stem cells [52], 
as indicated by gene expression patterns. A functional genomics 
approach and cell-based assays have demonstrated that specific 
lincRNAs could be transcriptionally regulated by key transcription 
factors in diverse biological processes [13]. In addition, recent 
studies have shown that some lincRNAs act as the precursor to 
microRNAs and are capable of regulatory function in response to 
cellular stress or oncogenic signals [53]. It is well recognized that 
non-coding RNAs play a regulatory role in several complex 
processes in the nucleus and cytoplasm [39,54]. This difference in 
localization of lincRNAs suggests diverse mechanisms of regula- 
tion and function of lincRNAs involved in the nucleus and 
cytoplasm. Approximately 30% of embryonic stem cell lincRNAs 
are implicated in regulation in the nucleus and can be associated 
with multiple regulatory complexes to affect neighboring regions 
[55]. Alternatively, IncRNAs can participate in RNA- RNA 
interactions to carry out their regulatory roles in the cytoplasm 
[56]. Recendy, it has been shown that the presence of a binding 
site for microRNA in the conserved site of a lincRNA gene may 
regulate lincRNA expression levels [34,57,58]. Our study showed 
that lincRNA-ENST000005 15084 was moderately more abun- 
dant in the cytoplasm than in the nucleus of fractionated breast 
cancer cells, suggesting that the function of this lincRNAs is 
exerted in the cytoplasm. Our results provided strong evidence 
supporting a hypothesis for cytoplasmic regulation, in which the 
lincRNA-ENST000005 15084 rsl2325489C>T SNP may affect 
the expression of this lincRNA by modifying the binding site for 
the miRNA-370. Our phenotypic experiment also demonstrated 
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